Bayesian Nonparametric Multilevel Clustering with Group-Level Contexts

نویسندگان

  • Tien-Vu Nguyen
  • Dinh Q. Phung
  • XuanLong Nguyen
  • Svetha Venkatesh
  • Hung Hai Bui
چکیده

We present a Bayesian nonparametric framework for multilevel clustering which utilizes grouplevel context information to simultaneously discover low-dimensional structures of the group contents and partitions groups into clusters. Using the Dirichlet process as the building block, our model constructs a product base-measure with a nested structure to accommodate content and context observations at multiple levels. The proposed model possesses properties that link the nested Dirichlet processes (nDP) and the Dirichlet process mixture models (DPM) in an interesting way: integrating out all contents results in the DPM over contexts, whereas integrating out group-specific contexts results in the nDP mixture over content variables. We provide a Polyaurn view of the model and an efficient collapsed Gibbs inference procedure. Extensive experiments on real-world datasets demonstrate the advantage of utilizing context information via our model in both text and image domains.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material for Bayesian Nonparametric Multilevel Clustering with Contexts

Vu Nguyen†, Dinh Phung†, XuanLong Nguyen‡, S. Venkatesh†, and Hung Bui∗ †Centre for Pattern Recognition and Data Analytics (PRaDA), Deakin University, Australia. {tvnguye,dinh.phung,svetha.venkatesh}@deakin.edu.au ‡Department of Statistics, Dept of Electrical Engineering and Computer Science University of Michigan. [email protected] ∗Laboratory for Natural Language Understanding, Nuance Commun...

متن کامل

Scalable Nonparametric Bayesian Multilevel Clustering

Multilevel clustering problems where the content and contextual information are jointly clustered are ubiquitous in modern datasets. Existing works on this problem are limited to small datasets due to the use of the Gibbs sampler. We address the problem of scaling up multilevel clustering under a Bayesian nonparametric setting, extending the MC2 model proposed in (Nguyen et al., 2014). We groun...

متن کامل

Gender-based Differences in Associations between Attitude and Self-esteem with Smoking Behavior among Adolescents: A Secondary Analysis Applying Bayesian Nonparametric Functional Latent Variable Model

Background: Different patterns of gender-based relationships between attitude toward smoking and self-esteem with smoking behavior have reported. However, such associations may be much more complex than a simply supposed linear relationship. We aimed to propose a method of providing hand details on the total and gender-based scenarios of the relationships between attitude toward smoking and sel...

متن کامل

Semiparametric bayesian inference for multilevel repeated measurement data.

We discuss inference for data with repeated measurements at multiple levels. The motivating example is data with blood counts from cancer patients undergoing multiple cycles of chemotherapy, with days nested within cycles. Some inference questions relate to repeated measurements over days within cycle, while other questions are concerned with the dependence across cycles. When the desired infer...

متن کامل

Bayesian Framework for image segmentation Based on Nonparametric Clustering with Spatial Neighborhood Information

In this paper, we present a Bayesian framework for image segmentation based upon spatial nonparametric clustering. To estimate the density function on a nonparametric form, the 1 / 4

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014